Structural Phrase Alignment Based on Consistency Criteria
نویسندگان
چکیده
In this paper, we propose a new method for phrase alignment using a dependency type distance and a distance-score function. With this method, appropriate correspondences can be selected among correspondence candidates that often include ambiguous or incorrect ones. Furthermore, this method makes it possible to measure the overall alignment consistency. We conduct an alignment experiment using 500 parallel sentences on newspaper domain, and achieve an F-measure improvement of 35 points over the simple statistical method (GIZA++), and 3.0 points over a baseline system. We also conducted a translation experiment and achieved a BLEU score improvement of 0.4 points over a baseline system.
منابع مشابه
Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality
We present a novel approach to improve word alignment for statistical machine translation (SMT). Conventional word alignment methods allow discontinuous alignment, meaning that a source (or target) word links to several target (or source) words whose positions are discontinuous. However, we cannot extract phrase pairs from this kind of alignments as they break the alignment consistency constrai...
متن کاملKyoto-U: Syntactical EBMT System for NTCIR-7 Patent Translation Task
This paper describes “Kyoto-U” MT system that attended the patent translation task at NTCIR-7. Example-based machine translation is applied in this system to integrate our study on both structural NLP and machine translation. In the alignment step, consistency criteria are applied to solve the alignment ambiguities and to discard incorrect alignment candidates. In the translation step, translat...
متن کاملStatistical Phrase Alignment Model Using Dependency Relation Probability
When aligning very different language pairs, the most important needs are the use of structural information and the capability of generating one-to-many or many-to-many correspondences. In this paper, we propose a novel phrase alignment method which models word or phrase dependency relations in dependency tree structures of source and target languages. The dependency relation model is a kind of...
متن کاملAn iterative refinement algorithm for consistency based multiple structural alignment methods
MOTIVATION Multiple STructural Alignment (MSTA) provides valuable information for solving problems such as fold recognition. The consistency-based approach tries to find conflict-free subsets of alignments from a pre-computed all-to-all Pairwise Alignment Library (PAL). If large proportions of conflicts exist in the library, consistency can be hard to get. On the other hand, multiple structural...
متن کاملDiscriminative Phrase-based Lexicalized Reordering Models using Weighted Reordering Graphs
Lexicalized reordering models play a central role in phrase-based statistical machine translation systems. Starting from the distance-based reordering model, improvements have been made by considering adjacent words in word-based models, adjacent phrases pairs in phrasebased models, and finally, all phrases pairs in a sentence pair in the reordering graphs. However, reordering graphs treat all ...
متن کامل